COORDINATE RESTRICTIONS OF LINEAR 
OPERATORS IN 



R.VERSHYNIN 

Abstract. This paper addresses the problem of improving prop- 
erties of a hnear operator u in I2 by restricting it onto coordinate 
subspaces. We discuss how to reduce the norm of u by a random 
coordinate restriction, how to approximate m by a random opera- 
tor with smaU "coordinate" rank, how to find coordinate subspaces 
where u is an isomorphism. The first problem in this list provides a 
probabihstic extension of a suppression theorem of B. Kashin and 
L. Tzafriri, the second one is a new look at a result of M. Rudelson 
on the random vectors in the isotropic position, the last one is the 
recent generalization of the Bourgain-Tzafriri's invertibility prin- 
ciple. The main point is that all the results are independent of n, 
the situation is instead controlled by the Hilbert- Schmidt norm of 
u. As an application, we provide an almost optimal solution to the 
problem of harmonic density in harmonic analysis, and a solution 
to the reconstruction problem for communication networks which 
deliver data with random losses. 



1. Introduction 

Linear operators in a finite dimensional Hilbert space H constitute 
one of the most fundamental classes of operators in Functional Analysis. 

Recall a classic observation for an arbitrary operator u on H. There 
can be found an orthonormal basis in H so that, up to an isometry, u is 
a diagonal operator with respect to that basis, and its diagonal entries 
Si > S2 > ■ ■ ■ > sj\f > satisfy ~ II^IIhs' where ||m||hs denotes 

the Hilbert-Schmidt norm of u. This provides enough information on 
how u acts with respect to the chosen coordinate structure on H. For 
instance, restricting u onto an appropriate coordinate subspace M"^ we 
can cut off large s-numbers Sj to improve the norm of m, or to cut 
off small s-numbers Sj to nicely approximate u by an operator with a 
smaller rank. 

But what if a coordinate structure on H already exists and has no 
relation to m? In other words, can one still improve the properties of a 
linear operator u in by restricting it onto coordinate subspaces? 
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As a first result of tliis paper, we will compute the norm of a random 
coordinate restriction of u. Precisely, we bound 

ElhUII (1) 

where cr is a random subset of {1, . . . , A^} of a fixed cardinality n < 
N. The history of the question is the following. A known theorem 
of M. Talagrand [[la 95 1 gives an upper estimate on (|l|) for a linear 



operator from I2 into a Banach space X (see |[la 98|| for a further 



generalization). However, the estimate of M. Talagrand is close to 
being sharp only in a certain, quite restrictive, range of n (needed in 
applications to the A(]9)-problem). After partial results of B. Kashin 
[Ka| | and A. Lunin |[Lu| ] , B. Kashin and L. Tzafriri | [Ka-'l'^| produced an 
argument which essentially proves an optimal estimate on the minimum 
of over all subsets a as above. It equals to 

where h = ||m||hs is assumed norm one (see 0). The crucial 



step in the proof of (H) is Grothendieck's factorization (see ||Le-Ta 
Proposition 15.11), which gives no information about a except that it 
can be found in a random subset of {1, . . . ,N} of cardinal, say, 2h. 
Note that (0) is equivalent to 

minJI^I^J <c(y^ + max||ue,||), (3) 

because by Chebyshev's inequality at least ^A^ numbers \\uej\\ do not 

exceed 2-^/^. We will prove that, 1 
estimate holds for a random set a. 



exceed 2-i/-^. We will prove that, up to a logarithmic factor, the same 



Proposition 1.1. Let u be a norm one linear operator in I2 ■ Consider 
an integer 1 < n < N . Then for random subset a of {1, . . . , A^} of 
cardinal Icrl = n 



E||M|jg<,|| < Clog?2^^^^ + max llwcj 



In particular, for the threshold dimension, \a\ = h, we have 

< Clog/i ■ max IImCjII. (4) 



Note that the dimension A^ plays no role in (^. Instead, the situation 
is completely controlled by the parameter h = \\u 



The key to the proof of Proposition |L1| is the non- commutative Khin 
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chine inequahty in the Schatten class C^, due to F. Lust-Piquard and 
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G. Pisier (see 0). Its usefulness for coordinate restrictions in was 
recognized by G. Pisier. This provided an alternative approach to a 
lemma of M. Rudelson (see also 0) previously proved by a delicate 
construction of a majorizing measure. We will use the following non- 
simmetric version of M. Rudelson's lemma, which also follows from the 
non-commutative Khinchine inequality. For a finite set of vectors Xj, rjj 
in R'^ 

e|| EjXj Uj < Cy'lognf max ■ || Uj ® yj 

El/2\ 
Xj Xj I (5) 

" i ^ 

where Ej are independent Bernoulli random variables with Probje^ = 
1} = Prob{£,- = -1} = |. 

Next, we will prove an "approximation" counterpart of Proposition 
p..l| . Recall a well known inequality for the approximation numbers of 
an operator m in 

an{u) < \ - (6) 
V n 

where a„ = inf — : rank-ui < n}. We will obtain a coordinate 
version of (P) by a different look on arguments of M. Rudelson . 

Theorem 1.2. Let u be a norm one linear operator in I2 , and h = 
WMI'hs- Then for any integer n > 1 there exists a diagonal operator A 
in I2 such that rankA < n and 



\u(A - id)u*\\ < CJlogn- \ -. 

V n 



(7) 



Examples show that both u and u* are needed in (^. A is a random 
diagonal operator whose entries are multiples of independent selectors. 
It depends only on the values of ||Mej||; the larger ||Mej|| is, the more 
likely the j-th entry of A is not zero. For such random operator A, (|^ 

holds with large probabihty. Namely, if we set e = C i/log n ■ \ - then 



Prob{||M(A - id)u*\\ > te} < 3exp(-t2). (8) 

The reader should note that the dimension of the space plays 
no role in this result as well as in Proposition p..l| , and the situation 
is again controlled by the parameter h = ||m||hs- This phenomenon 
seems quite general. It roughly says that for a linear operator u in 
I2 the Hilbert-Schmidt norm of u (an not the rank, for example) is 
responsible for the essential properties of u. Another instance of this 
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phenomenon is the recent extension of Bourgain-Tzafriri's principle of 
restricted invertibihty 0. 

Theorem 1.3. Let u be a norm one linear operator in and h = 
\\u\\jjg. Then for any e > there exists a subset a o/ {1, . . . , A^} of 
cardinal \a\ > {l — e)h so that the sequence {Tej)j^cr is C{e)- equivalent 
to an orthogonal basis. 



In other words, under the hypotheses of Theorem |L3| there exists 
an isomorphism T with ||T||||T~^|| < C{e) which takes Cj to uej/\\uej\\ 
for j G a. We see again that the parameter h = ||u||yg governs the 
situation, and the dimension N is unimportant. 

There is an important apphcation of Theorem L3 to "unbounded" 
operators u in Z^. It says that if the norms ||Mej|| are controlled, then 
M is a nice isomorphism on a large coordinate subspace. 

Corollary 1.4. Let u be a linear operator in such that \\uej\\ = 1 
for all j = 1, . . . , N. Then for any e > there exists a subset a of 
{1, . . . , A^} of cardinal \a\ > (1 — so that 

ciie)\\x\\ < \\ux\\ < C2(£)||x|| forxe (9) 

This is a direct generalization of a theorem of J. Bourgain and L. Tza- 
friri, who proved Corollary L4 for some < £ < 1 and with only the 
lower bound in (|). The upper bound is also nontrivial, as we do not 
assume the operator u to be well bounded. 

An application of Corollary lA to harmonic analysis generalizes re- 
sults on the problem of harmonic density P-Tz|| . Let T be the circle 
with the normalized Lebesgue measure z/, and i? is a subset of T of 
positive measure. The two norms naturally arise here, 

^ Ifl'du^' and = ( / \f\'duy^\ 



L2{B) 



u{B) 



B 



In general there is no relation between ||/||l2{-B) and ||/||l2(t)- However, 
suppose 5 is a half-circle; then it is easily seen that the two norms are 
equal for the functions / whose Fourier transform is supported by 2Z. 
Then (a rather vague) question is 

For what functions / are the two norms 
\l2[b) and ||/||l2(t) equivalent? 



More specifically, consider functions / whose Fourier transform / is 
supported by a fixed set of integers A. How dense can A be so that the 
two norms are still equivalent? 

This problem in a weaker form was stated by W. Schachermayer. His 
question was whether there exists a set A of integers such that / does 
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not vanish a.e. on B provided supp/ C A. Answering this question 
positively, J. Bourgain and L. Tzafriri proved that the existence of A 
with density cv{B) for which 



L2{B) 



>C||/|U.(T) (10) 



whenever supp/ C A. The proof rehes on the principle of restricted 
invertibility. The extension of this principle, Corollary |1.4| , can be 
used to prove the reverse inequality in ( p!0D and also to obtain a nearly 
optimal density (1 — e)v{B) of A. 

Theorem 1.5. Let B he a subset of T of positive Lehesgue measure, 
ande > 0. Then there exists a set of integers A with (two-sided) density 
densA > (1 — e)v{B) so that 

Cl(5)||/|U,(T)<||/|U.(B)<C2(£)||/|U.(T) (11) 

whenever the Fourier transform of f is supported by A. 

The two-sided density here is densA = lim„^oo l^^nhZ!£]dl_ This defi- 
nition seems to be more natural for subsets of Z than the usual "one- 
sided" definition. 



In Appendix, we return to Theorem |1.2| and discuss its rather un- 



expected application. Together with the concentration inequality 



Theorem |1.2| provides a solution to the reconstruction problem in com- 
munication systems such as the Internet which deliver data with ran- 
dom losses ||G-K|| . 



I am grateful to P. Casazza and to G. Schechtman for useful discus- 
sions. 

2. Suppressions on Coordinate Subspaces 

2.1. Result. In this section we will compute the norm of the random 
coordinate restriction Hitlg-tjH of an arbitrary linear operator u on . 
The size of a is a fixed integer n not necessarily equal h{= ||m||hs) ^ls 



in Proposition 1.1. 



The minimum of Iplj^^H over all subsets a of cardinal n is provided 
by Kashin-Tzafriri's theorem [ Ka-l'z| ] (see |]V| for a proof). 



Theorem 2.1. (B. Kashin, L. Tzafriri). Let u he a norm one linear 
operator in I2 , and h = ||m||hs- Then for any integer n > 1 there exists 
a subset a of {1, . . . , A^} of cardinal = n such that 
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We see that the treshold value for the size of a is h: for n < h the 

best restriction H^lj^^ || is bounded by ^J^, while for a larger size n the 

best bound is a/^- This is illustrated by an example in Section |3l^ , 
which yields that (p!2|) is sharp up to a constant. 



Before we state a probabilistic extension of Theorem let us spec- 
ify what we mean by a random subset a. There are several equivalent 
definitions, of which the following one seems more convenient through- 
out the present paper. Let A be a finite set and < 5 < 1. Consider 
a subset a of A by taking (rejecting) each element of A independently 
with probability 6 (respectively, 1 — 6). Then the cardinal of a is con- 
centrated around n = S\A\. We then call a a random subset of A of 
cardinal \a\ ~ n. 

Theorem 2.2. Let u be a norm one linear operator in Let M = 
maxj WuCjW . Then for any integer n > 1 and for a random subset a of 
{1, . . . , A^} of cardinal \a\ ~ n 



E\\u\ 



(13) 



The key to the proof is a non-simmetric version of M. Rudelson's 
lemma from |0 (see also 0]). 

Lemma 2.3. Let Xj, yj be a finite set of vectors in 

m 



E 



max 

j 



+ max 

j 



Xj I 



J2 ® yj 

Xj 



Then 

1/2 
1/2 



(14) 



This lemma is a consequence of the non- commutative Khinchine in- 
equality in the Schatten space C™, with optimal constant 0{y/p), for 
p = logm. This inequality is a result of F. Lust-Piquard and G. Pisier 
(see 0). 

Theorem 2.4. ( F. Lust-Piquard, G. Pisier). Assume 2 < p < oo. 
Then there is a constant Bp < Cy^ such that for any finite sequence 



(x,) zn c; 



R{X, 



< 



< Bp ■ R(^Xj 



where 



R{Xj 



1/2 



V 



1/2 
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Proof of Lemma |2.3| . Note that for p = log m 

< \\X\\c^ < e\\X\\. (15) 

Then we can apply Theorem O for Xj = Xj yj noting that X*Xj = 
WyjW^Xj Xj and XjX* = WxjW^yj yj- We get 



e|| EjXj ®yj 5^ ^j^j ® Vj 

3 j 



Xj 
1/2 



1/2 



By (|r^) we can replace both || • Hc-m-norms by || ■ || -norms, which easily 
leads to the completion of the proof. ■ 

Proof of Theorem |2.2| . For Xj = uej^ j = 1, . . . , TV we can write 
We need to compute 



6 j X j . 



jjCj 09 Xj 



E:=E\\u\^J=e\\J2Sj 

where 5j are {0, l}-valued independent random variables with MSj = 
5 = ^y^■ This will be done by a usual symmetrization and applying 

(14) twice; first to bound E in terms of -Ei = E "^SjXj Xj , and 

then again to compute Ei. 

Now we pass to a detailed proof. The standard symmetrization pro- 
cedure (see ||Le-Ta|] Lemma 6.3) yields 

^ < e|| ^(5j - 5)ej (g) +5\\u\\ 

j<N 

< 2EEe|| ^ejSjej 



Yy Xj 



+ 6. 



Then we apply (|I^) to bound Ee|| ^ ejCj i^jXj) . Clearly in (|T^) we 
can set m equal 



Ni6) :=ev(5^5, 
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Then using Cauchy-Schwartz and Jensen inequalities we obtain 

1/2 







J2 SjXj ® 








/ \ 1/2 

< C(\ogEN{6)] 












< C^/logSN{Ey^ + M) 


+ 5, 



1/2 N 2 

+ M 



1/2 



+ 5 



+ M 



+ 6 



(16) 



where 



Therefore the problem reduced to computing Ei. By the standard 
symmetrization and noting that || ® ^ill — ll^"'-**!! < 1 we have 



E^<E\\j2iSj-S) 



+ 6 
+ 6. 



We apply (0) again. 

El < C (Elog N {6)y^^ ■ M ■ (E^I^^SjXj 



Xj 



j<N 



1/2 



+ 6 



< C^/\og6N ■ M ■ eI'"^ + 5. 
It follows that 

< C{^\og5N ■ M + V~5). 
Combining this estimate with (|T6|) we obtain 

E < C'v/loglA^(v/loglAr-M + 75). 
This completes the proof. 



2.2. Dimension Independent Corollary. Let us take a closer look 
at (|13|). If we disregard for a moment the distracting logarithmic terms 
we will see the point of this estimate. It essentially says that the norm 
of u is bounded on most coordinate subspaces of dimension n by the 
maximum of and M = maxj ||ttej||. The size of M is a natural 
obstacle here; the only a priori information about M is that 



(17) 
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where, as usual, h = ||u||hs- However, the upper bound for M can be 
reduced to essentially as explained in the introduction. 

So, the new information (|13D gives is that the optimal estimate of 
Kashin and Tzafriri on the minimal suppression lli^lj^all essentially 
holds for most subsets a. 

Taking n = h log h, we obtain through (p!7|) 



E\\u\^^\\<Cy^U^^^ + ^yi^M^ =C\ogh- 



and this estimate seems to be sharp up to a constant. We single it out 
as a corollary. 

Corollary 2.5. Let u be a norm one linear operator in I2 , and h = 
\\u\\jjg. Then for a random subset a of {1, . . . ,N} of cardinal \a\ ~ 
h\og h 

EIImL^II < Clog/i ■ max ||Meo||. (18) 

j 

Quite remarkably, the dimension N is not important here. We ob- 
tained a probabilistic version of Kashin- Tzafriri's estimate 

-i-JhLH^/l (19) 



since M = maxj ||Mej|| can be easily reduced to cy ^ as explained 
above. 



How sharp is Theorem |2.2[ ? It can be shown (considering the example 
from Section p.2|) that the first logarithmic term in (p!3D is necessary, 
and therefore (|13]) is sharp up to the second logarithmic factor (before 
M). This shows in particular that Theorem |2.2| is sharp for large n. 

3. Approximation on Coordinate Subspaces 



3.1. Result. In the present section we will prove Theorem p..2| and the 
■?/'2-estimate (H) . To highlight the diension independentness in Theorem 



1.2, we state it as an infinite dimensional result. 



Theorem 3.1. Let u be a norm one linear operator in I2, and h = 
W'^W'hs ^ Then for any integer n > 1 there exists a diagonal 

operator A in I2 such that rankA < n and 



|u(A - id)u*\\ < CJXogn ■ \ -. 

V n 



Setting C{e) = Ce ^log(l/e), we obtain an "approximation" coun- 
terpart of Corollary |275|. 
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Corollary 3.2. Let u be a linear operator in I2 with ||n|| = 1 and 
h = II M 111^5 < 00. Then for any e > there exists a diagonal operator 
A in I2 such that 

||m(A - id)u*\\ < e, (20) 

and rankA < C{e)h(l + \ogh). 

This result can be viewed as a coordinate version of the basic in- 
equality for the approximation numbers of u which follows from (^), 

ac{e)h{u) < e 

where c{e) = 



The diagonal operator A is random, and its diagonal entries A(j) 
can be easily described. Let K = n/h. 

• If WuCjW = then we set A(j) = 0. 

• If fr||uejp > 1 then we set A(j) = 1. 

• If < f^llMejll^ < 1 then A(j) is a random variable independent 
of the other entries and distributed as 

Prob{A(j) = -j^u^} = 1 - Prob{A(j) = O} = K\\ue,f. (21) 



Proof of Theorem |3.1| . The proof requires just a different look at 
the argument of M. Rudelson [0. The key step is the application of 
Lemma |2]^ for Xj = yj = ucj. 

We are going to prove for the random operator A that 

E := E||m(A - id)u*\\ < e. (22) 

First note that A has finite rank with probability 1, because by the 
Chebyshev inequality 



ErankA < \{] : KWuCjW^ > l}\ + ^K\ 

< Kh + Kh< 00. (23) 

This observation allows us to concentrate only on finite-dimensional 
operators u. To make this claim precise, put un = uPn, where Pn 
is the coordinate projection in I2 onto M^. Let A^r be the diagonal 
operator defined as above for the operator un- Then A^v = AP/v = ^ 
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with probability— 1 as 00. Thus 

u{A — id)u* — UAr(Ajv — id)u*j^ = u{A — id)u* — 

- uPn{APn - id)PNU* 
= {uAu* - uAPnu*) + 

+ {uu* — uPjyU*) 
= UU* — uPnU* 

with probabihty — > 1 as ^ 00. The norm of this operator vanishes as 
N ^ 00 because ||m||hs < 00. This shows that the difference between 
\\u{A — id)u*\\ and \\un{An — id)u*^\\ is at most En, with probabihty 
at least 1 — en, where ^tv— ^-OasA^^cxD. Therefore we can assume 
in ( p2D that u acts in a finite dimensional space 

As we already saw in (pSf), the operator A defined has the required 
rank 



ErankA < 2Kh = 2n 
(the factor 2 is of course not important). Let 



(24) 



UCj, J 



N. 



Then we can write 



N 



UU 



so 



E 



E 



J2 ® X 



UU 



(25) 



At this point we can assume that 



< -ft'llsjlp < 1 for all j = 1, 



Indeed, if f^UxjlP is either or 1, then by the construction the sum- 
mand (A(j) — 1) xj (S> xj vanishes and contributes nothing to the sum 
in (pSf) . Therefore, we can assume that in (|25|) all A(j)'s are indepen- 
dent random variables with distribution (pT]). Since E(A(j) — 1) = 
for each j, we can apply the standard symmetrization procedure (see 
Lemma 6.3) which gives 



ILe-'ia 



E < 2EeJ| ^EjA{j)xj 



® X. 



12 R.VERSHYNIN 

where the expectation is taken according to the Rademacher vari- 
ables Ej. 

To bound the latter expectation, we fix a realization of A and apply 
Lemma p.3| for Xj = yj = A(j)^/^Xj, j = 1, . . . , A^. The number of 
non-zero terms in this sequence is at most rank A, so we can assume 
that m in the lemma equals 

Rank A := e V rank A. 

We obtain 

1/2 



E<CE 
Note that 



(logRankA)^/^ ■ (^maxA{jY/^\\xj\\^ ■ || ^A{j)xj ® 



maxAOy/^llxJ = R-^^^. 



Then by the Cauchy-Schwartz inequality, Jensen's inequality and using 
II Yl!j=i ® ^i|| = ll""^*!! < 1 we obtain through (|2^) 



1/2 ^ 

E < Cis:"i/2(^ElogRankA) (^E ^ A(j>j- 



1/2 

' '- -1/2 



1/2 



< Cir"^/^(^logERankA) (E + 1)^^ 
<Cir-i/2(logn)V2(E + 1)1/2 

= C^/I^\l^iE + iy/^ (26) 
V n 



We can assume that Cv^logny ^ < 1, otherwise the conclusions of the 
theorem hold simply with A = 0. Then (120) implies 



E < CJ\ogn\ -. 

V n 



Hence the conclusions of the theorem hold with non-zero probability. 
This completes the proof. ■ 



3.2. Remarks about the form of approximation. Our first obser- 
vation is that both u and u* are needed in (pOD, whatever the norm 
||m||hs is. To see this, consider the following example essentially bor- 
We consider two positive integers h and k and set 



rowed from Ta 95 
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= hk. We define an operator u in "blockwise" by its action on 
the coordinate basis: 



k vectors k vectors k vectors 



V 

A'' vectors 



In other words, we divide {1, . . . , A^} into h sets Ai^ I < h, of cardi- 
nal /c, and for j G Ai we set uej = Sij^k. Then obviously = 1 
and ||m||hs — ^- Let A be an arbitrary diagonal operator in with 
rankA = n, where n = C{e)h\ogh. Consider a = 1 < i < 
N, A{i) = 0}. Since |cr| > N — n, there exists an integer / < h 
such that 

N -n 
h 

Notice that for j G cr fl all vectors u{id — A)ej equal the same vector 
ci I \/k. This implies that 



, AM, iW^Al N-n N-n 

If k is chosen large enough, then the ratio = ^ is small. In 

particular 

\\{id-A)u*\\ = \\u{id-A)\\ > 1/2. 
This contradicts (|20|) . 



This example can also be used to show that Theorem ^]T| is sharp 
up to a constant, and that Theorem ^]2| is sharp up to the second 
logarithmic factor (we leave this to the interested reader). 



A comment is in order about the form of the operator A. One is 
tempted to say that A should look like a projection, i.e. A = aP for 
some number a and a coordinate projection P. This is not always true 
(again whatever the norm ||m||hs is), as the following modification of 
the previous example shows. We again consider two positive integers 
h and k, but set A^ — 1 = hk. We define an operator u in similarly 
to the previous example. Namely, we divide {1, . . . , A^ — 1} into h sets 
Ai, I < h, of cardinal k, and for j G Ai we set 

Xj = UCj = Ell Vk. 

Finally, set 

uejsi = Ch+i- 
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Note that 
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uu 



^ (g) + e„+i (g) Ch+l 



i<N-l 



j<h+i 



(27) 



Then ||-u|| = 1 and ||m|||^5' = h + 1. Let n = C{e)h\ogh. 

Now assume that there exists a number a and a set a C {1, 2, ... , N}, 
\(x\ < n, so that 



\u{aPa — id)u*\\ < e, 



(28) 



where P„ is the coordinate projection onto Mf^, and M = C{e)hlogh. 
We want to get a contradiction. To this end, note that the term Ch+i ® 
e/i+i must be present in the expansion of uPaU* - in other words, u 
must contain A^. Indeed, otherwise 

{auP^u*)eh+i = 0, 

although 

\\{uu*)eh+i\\ = 1. 

This will contradict (p^). 

Next put u = a \ {N}. Then, with rrij = \Aj fl z/|, j = 1, . . . , h we 
can write 

uP^u* = ^ (g) Xi j + eh+1 ® Ch+i 



Then by (^3) have 
auPaU* — uu* 

E 



Tfl ■ 



+ (a - l)e/,+i ® e^+i. (29) 



This is a diagonal operator, so its norm equals 



max 
j=i,...,h 





1 




{ 


a—^ — 1 


, \a - 1|| 




k 





Since by ( pS]) this norm must be less than e, we have a < 1 + e. On 
the other hand, 

= |z^| < n. 



COORDINATE RESTRICTIONS OF LINEAR OPERATORS IN K 



15 



Therefore there exists a j < h so that mj < n/h. Then 

n 



rrij 
a-^ - 1 




rrij 


> 1 - 




k 







>1-(1 



eja- 



n 



hk ' 'N 

If0<£:<l/2 and h is essentially smaller than A^, then n/N is close 
to 0, making the norm of operator in (^9]) close to 1. This contradicts 



(i 



3.3. Tail probabilities. As a natural strengthening of Theorem |3.1| , 
we will compute the tail probability 

Prob{||M(A - td)u*\\ > t}. 

Since the operator uu* which we are approximating in Theorem |3.1| has 
norm at most one, the interesting range for t is < t < 1. 

Proposition 3.3. For the random diagonal operator A in Theorem 
letting 



(30) 



we have 

Prob{||M(A - zd)u*\\ > t} < 3 exp - 
for alio <t < 1. 

This estimate is a consequence of the following lemma ( ||Le-Ta|| Lemma 
3.7), which can be easily proved via expansion of the exponential func- 
tion. Recall the definition of the ?/^p-norm of a random variable Z, for 
p > 1: 



inf {A > : Eexp(Z/A)P < e}. 



Lemma 3.4. Let Z be a positive random variable, and d be an integer. 
Then the following are equivalent: 
(i) \\Z\\p < allp> 1. 



(u)\\Z\ 



i'2/d 



< Co 



The constants 6*1,6*2 only depend on each other. 

This observation reduces our problem to computing the p-th moment 
of ||'u(A — id)u*\\. To this end note that the proof of Lemma ^]3| gives 

an estimate on the p-th moment of || SjXj ®yj . In particular case, 

for Xj = yj, we have 

Lemma 3.5. (M. Rudelson) Let {yj) be a finite set of vectors in M™. 
Then for p > 1 



( II ^\ 

(^^11 ^^y-j ) < c*(p V log m) 



1/2 



max 
i 



1/2 
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Lemma 3.6. Let Z is a positive random variable with \\Z\\^-^ > 1. 
Then 

\\Z\\p < Cplog(EexpZ) 

for all p > 1 . 

Proof. Let M = \\Z\\^^ > 1, then 

Eexp(Z/M) = e. 
By Lemma p.4| , \\Z/M\\p < Cp. Then by Jensen's inequahty 
\\Z\\p < CpM = CpM\og{Eexp{Z/M)) 

Cp\og fEexp(Z/M) 
< Cplog(EexpZ). 



Proof of Proposition |3.3| . Let 

Z = u{A - id)u*. 

Then (^0|) is equivalent to 

(ll^l|Al)^,<e. 



(31) 



By Lemma |3.4| there exists a constant c > such that (|3l|) is implied 

by 

(||Z|| A < c£j9^/2 for all p > 1. 

Note that {\\Z\\ A l)p < \\Z\\pA 1. Then, with Ep = \\Z\\p, it suffices to 
show that 



Ep Al< cep^/^ 



(32) 



for all p > 1. Similarly to the proof of Theorem |0 



N 



Ep<2 f EEJ 2^ ejA(j>j ®Xj \ . 



Applying Lemma |3.5| in the same context as before we get 

\P/2 

• K-''^^ ■ II 

3<^ 



Ep < C \ E 



(p V log Rank a)"^^ • R-p^"^ 



p/2 



1/p 



E(p Vlog Rank A j 



l/2p 



Xj Qv Xj 



l/2p 



(33) 
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To compute the first expectation we use Lemma pTq for Z = log RankA. 
Since RankA > e by the definition, > 1. Therefore 

l/2p 



E hoV log RankA 



a/2 



< C(p + plogERankA)^/2 

/ \ 1/2 

< C(p\og{Kh)] 

as in the proof of Theorem |3.1|. As for the second expectation in 



l/2p 



< {Ep + 1) 



Therefore, recalling that K = n/h we obtain 

E, < CoV^^p'^\E, + lY/' 
< ic/lO)ep'/'-iE, + lY/^ 



1/2 



(34) 



provided the constant Cq in the definition of e is large enough. 

Now, if (c/10)£:p^/^ > 1 then certainly (|32D is true. So we can assume 
that (c/10)£p^/^ < 1. Then (||) yields 

Ep < 2{c/10)ep^/^ < cep^l'^, 



which again implies (|32D . This proves (pTD and therefore completes the 
proof. ■ 



4. Isomorphisms on Coordinate Subspaces 

4.1. Result. In this section we will discuss the extension of Bourgain- 
Tzafriri's principle of restricted invertibility. Theorem |1.4| and Corollary 



L3i and its relation to the problem of harmonic density. 

For the proof of Theorem |1.4| we refer to 0. Note that it implies 
Corollary [L^ by homogeneity. 

One pleasant thing about Corollary |1.3| is that with some fixed e 
it can be deduced directly from Kashin-Tzafriri's suppression estimate 
(p!9|) and the original Bourgain-Tzafriri's theorem P-Tz|| , which we 



recall now. 

Theorem 4.1. (J. Bourgain, L. Tzafriri) Let u be a linear operator 
I2 such that \\uej\ 



in I2 such that llueJ = 1 for all j = 1, . . . , A^. Then there exists a 



subset a 0/ {1, . . . , A^} of cardinal \cr\ > Cpp- so that 

\\ux\\ > c\\x\\ for X G M." . 
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To deduce Corollary |L^, first apply Kashin-Tzafriri's suppression 
([T9|) to the operator u. Note that u is not necessarily norm one, so by 
homogeneity we obtain a subset ai of {1, . . . , N} of size = A^/H-up 
so that the norm of u on ]R°"^ is bounded by an absolute constant. Next 
apply Theorem |4.1| to u restricted to M*^^ . There exists a subset cr of ai 
of cardinality proportional to |cri| so that \\ux\\ > c\\x\\ for all x G M°". 
Hence Corollary |1.3| is proved (for some < £ < 1). 

The importance of the upper bound in Corollary |1.3| can be best 
illustrated by the following example, which links this theme to har- 
monic analysis. Let T be the unit circle with the normalized Lebesgue 
measure u. Consider an arc 5 C T, 

B = {e\ < t < 27r6}, 

and assume for simplicity that viS) = 6 is the inverse of a positive 
integer. Let Pb be the restriction onto B: 

Psf = fXB. 

How does Pb act on the natural "coordinate" structure generated by 
the characters e'*^*, k & If! P maps the characters to vectors of norm 
in L2(T). Therefore, if b is small then Pb is far from being an iso- 
morphism on any "coordinate" subspace generated by the characters. 
On the other hand, an easy integration shows that the vectors 



are orthogonal. Therefore, on the subspace of L2{T) generated by the 
characters 

the projection Pb is a multiple of an identity. 

The finite-dimensional counterpart of this situation is captured by 
Corollary pT3| . Applied to the operator u = -^Pb, it guarantees that 
the set of the first n characters contains a subset of size almost bn, on 
which Pb acts like an isomorphism. The infinite-dimensional extension 
of this result is the subject of the next section. 



4.2. Harmonic Density Problem. Our aim is to prove Theorem 
L5 . Following B-Tz|| , the proof will consist two steps. First, the 
invertibility result. Corollary |L^, implies a finite-dimensional version 
of Theorem p..5|. Next we apply a combinatorial result of I. Z. Ruzsa 



|Rus|| , which allows to pass from large finite sets to a needed infinite 



set A of large density. 
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Definition 4.2. Let Ti be a set of finite sets of integers. Ti is called 
a homogeneous system if for every A ^Ti, all the subsets and transla- 
tions of A belong to Ti. 

Given a homogeneous system 7i, there exists a hmit 

M-rj^ V l-4n[l,n]| 

diri) = hm max -. 35) 

n^oo Ae-H n 

(note that the sequence under the hmit is non-increasing). 
Theorem 4.3. (I. Z. Ruzsa ||Rus|] j. Given an arbitrary homogeneous 



system Ti, there exists a sequence of integers A such that its finite 
subsets all belong to Ti and 

densA = d{n). 

In |Ru£] this theorem was proved for the one-sided density of A, i.e. 



for hm„^oo i^^^^^jii^. However in our case it seems more natural to work 
with the two-sided density 

|An[-n,n]| 
dens A = hm 

n^oo 2n 

Our first buisness wiU be to prove Ruzsa' s Theorem [4.3| for the two- 



sided density. This requires only a slight modification of the original 
argument. At the first step, we find arbitrarily large finite subsets in Ti. 
with optimal "hereditary density". This is the content of the following 
lemma from |[l{.us|| , which we include without proof. 



Lemma 4.4. (I. Z. Ruzsa). Let Ti be a homogeneous system. For 
arbitrary n there exists a set A & H, A C [l,n] satisfying 

\An[i,k]\ 



k 



> d{n) for alll<k<n. 



Proof of Theorem |4.3| . By the homogeneity of Ti,, there also exists 

, A C [— n, n] satisfying 

> d(n) for all 1 < A; < n. (36) 



for arbitrary n a set A G H, A C [— n, n] satisfying 

\An[-k,k]\ 



2k 

We define an ordered tree G. Its n'th level Gn consists of all the sets 
A E H, A C [—n,n] satisfying (p^). A vertice A G Gn is connected to 
a vertice B G Gn+i if A C B . This describes all the edges of G. 

Then the graph G is infinite, joined (since every A G Gn+i is con- 
nected to A n [— n, n] G Gn), and every vertex of G has only finitely 
many edges going to it. Then by Konig's Infinity Lemma (see e.g. 0) 
there exists an infinite path in G. So, let {An)n>i be a chain in G: 

■^n ^ Gn, An C ^n+l; = 1, 2, . . . 
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Then we claim that the set 

A = \jAr 



satisfies the conclusions of Theorem |4.3| . Indeed, all finite subsets of 
A belong to Ti by the construction. To show that the density of A is 
d{T-C), note that 

\An \-n,n\\ \AJ „^ 

2n - 2n - ^ ' 
from which it follows that the lower two-sided density densA > diTi). 
Similarly, since the sets A fl [— n, n] belong Ti and by homogeneity 

|An[-n,d| Un[-n,n]| Un[l,2n + ll| 

< max = max ■ 



2n ~ Aen 2n agh 2n 

Passing here to the upper limit as n — >■ cxd, we conclude that the upper 
two-sided density densA < d(H). (note that in ([35| ) the limit exists). 
This completes the proof of Theorem |4.3| . ■ 

Proof of Theorem |1.5| . Define an operator u on L2{T) as 



B- 



Note that = v{By^/^ and ||u(e^^*)|| = 1 for all k e Z. Then we 
apply Corollary pTS] . For every positive integer n, we get a subset 

cr„ C {1, . . . ,n} 



of cardinality 
for which 



|o"n| ^ (1 — £)v{B)n 



Cl(5)||/1U.(T) < ^{B)-'"\\f\\L,m < C2(e)||/|U.(T) 

whenever the Fourier transform of / is supported by a^. The middle 
part of this inequality is ||/||l2{-B)- Therefore we get: 

Equivalence (pT]) holds whenever supp/ C an- 

Consider the family Ti. of all finite subsets a of the integers such that the 
equivalence (|lT]) holds whenever the Fourier transform of / is supported 
by a. In particular, all sets cr„ belong to Ti. Clearly, the family Ti. is 
homogeneous. Since an ^Ti., 

d{n) > limsup ^ > (1 - e)iy{B). 

n— >oo IT' 

Then Ruzsa's Theorem yields the existence of a set A of integers 
whose all finite subsets belong to H, and with two-sided density 

densA > {l-e)u{B). 

This completes the proof in view of the definition of Ti. ■ 
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To see how sharp Theorem |1.5| is, let us look again at the example 
in Section |4.1| . We consider an arc 

B = {e**, < t < 2nb}, 

as a subset of T, where i'{B) = b is the inverse of a positive integer. 
We will show that if a set of integers A has two-sided density exceeding 
1/(5), then there exists a function / with supp/ C A and for which 
([TT|) fails; more precisely 

||/|U,(T) = 1 and ||/1U.(B)<« (37) 

where a can be chosen arbitrarily small. 

Tu put this differently, (|37D means that the sequence {e*^*, /c G A} is 
not a Riesz basis in L2{B). Assume the opposite. Then {e*'^*, A; G A} 
is equivalent to the canonical basis in I2, i.e. 

for all finite sets of scalars (a^). By change of variable. 



showing that for {A^} := 6A, the sequence of exponentials (e*'^'=*) is 
equivalent in L2(T) to the canonical basis of l2- Then we apply a 
classical result of N. Levinson on completeness of exponentials in Lp(T), 
see IP^evjl Appendix III.l or ||You| ] 3.2. Let n{r) denote the number of 



points Afc inside the disc \z\ < r, and we put 

t 

Theorem 4.5. (N. Levinson). The set (e*'^'=*) is complete in Lp(T) 
whenever 



lim sup (N{r) — 2r H — log 



r I > cxD. 

P 



In our setting the ration approaches IdensA > 1 as — > 00. 



2t 



Thus we have N{r) > 2r for r sufficiently large. Then by Theorem 45 
the system (e*'^'=*) is complete in L2(T). 

Note that this argument holds also if we remove a finite number of 
elements from (e*'*''=*), so that this system remains complete after the 
removal. This clearly contradicts to its equivalence to the canonical 
basis in 12- This finishes the proof. 

Remark. The use of the result of N. Levinson was suggested to me 
by V. Kadets. 
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5. APPENDIX. Application to Communication Systems 

We apply results of Section ^ to provide optimal estimates for a 
communication system which delivers data with random losses ||G-K| . 



A typical case we have in mind here is the Internet. A requested 
information is sent to a user in a sequence of " data packets" . If a data 
packet is lost on its way to the user, a protocol detects the missing 
packet and sends it again. However, the detection of the lost packet 
usually takes much more time than a successful delivery. This is the 
main source of large delays known to all network users. Therefore, 
instead of retransmitting the lost packets it is highly desirable to be 
able to recover the sent information using whatever received, despite 
the loss of some packets. The question is then how to distribute the 
source information among data packets? There should be some depen- 
dency between the packets, otherwise the information contained in the 
missing packets is irrevocably lost. 

Parallel to the development of wavelets and connected with it, there 
has arisen a simple but fruitful idea to represent information, viewed 
as a vector x in R"^, by its expansion through an identity 

id = Xj ® Xj (38) 
j<k 



for suitable vectors Xj G M™. These vectors are called a frame |Pa |. 
(More generally, a frame is a set of vectors xj for which ^ xj (S> xj is 
an isomorphism in l'^). Clearly, k > m. 

This way, a source vector x in M"^ is represented by k data packets 
- coefficients {xj,x), j = 1, . . . ,k, which carry complete information 
about X due to the reconstruction formula 

If k > m then this information is redundant; there is a kind of de- 
pendency between the packets. This way of representing x is often 
more resiliant to errors than the old method - expanding x using an 



orthonormal basis and transmitting each coefficient k/m times [Da|. 

A problem raised in ||G-K|| was: is this new method also resiliant to 
random losses of the packages? Specifically, if a random (but not too 
large) subset of the packages {xj,x), j = 1, . . . ,k is lost on the way 
to a user, can one essentially recover x by summing in (^) only the 
successfully delivered components? 

To make this scheme work with probability at least 1/2, the norms 
\\xj\\ have to be reasonably small, otherwise the contribution of the 
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summands in (^) can be too irregular; one can easily produce examples 
making this intuitive statement precise. 

Under this assumption the results of Section |^ imply that if at least 
C{e)mlogm packets are successfully delivered, then with high proba- 
bility the source vector x can be reconstructed by (pQ]) with precision 
e. The two main points here are that 

• The required number of successfully delivered packets does not 
depend on k; 

• no information is needed about the lost packets. 

As for the mentioned restriction on the norms ||a;j||, we will assume 
for simplicity they are equal to each other (and therefore to \/^)', a 
more general case requires only minor changes. 

Theorem 5.1. Consider a set of vectors xi, ... ,Xk in with equal 
norms and which satisfy (|38|). Let a be a random subset of {1, ... ,k} 
with cardinal \a\ ~ n (i.e. each element of {1,... ,k} is taken or 
rejected independently with probability n/k). Then for < t < 1 

id — - — j- Xj ® Xj 

jeer 

with probability at least 1 — 6 exp ( — t^/^^) , where e = C \/\og n ■ 



< t (40) 



Proof. Note that ||a;j|| = for all j. We will apply Theorem |3[1 
to the operator u : I2 ^ defined by 

It can easily be seen that 



Also 



\u\\ = 1 and h = ||m||hs — 



uu* = ''^^Xj (g) Xj = id, uAu* = A{j)xj ® Xj, 



3 3 
where A(j) denotes the j-th diagonal entry of A. Recall that, by the 
construction of A, in our case A(j) is a random variable independent 
of the other entries of A and distributed as 

Prob|A(j) = - j = 1 - Prob{A(j) = O} = ^. 

^ ft J rC 

Then 

k \ ^ 

\\u{A. — id)u*\\ = id (8) (41) 

jeer 

where a = {j : j < k, A(j) 7^ 0} is a random subset of {1, . . . , A^}, as 
described in the assumption of the theorem. 
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Then Theorem |3.1| and Proposition |3.3| give the needed probabihty 
of (^OD, but with |cr| replaced by n in the denominator. (However, 
we do not want n in the reconstruction formula since a priori n is 
not known). To complete the proof we note that by a standard con- 
centration inequality \a\ is close to n with high probability. Indeed, 
|cr| is a sum of k independent {0, l}-valued random variables 5j with 
5 := K6j = |. Then it follows from the classical bounds on the binomial 



law |P3o|| that for s < 26k 

1 < PVT. /'^^^^ ^ ^ 8n 



Prob{||a| - n| > 4 < exp ) < exp ( 



Then for s = tn 

Probj — 
I n 

This justifies the replacement of |cr| hy n and therefore completes the 



1 



> t [> < exp ( - — j < exp - - 



proof. 
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